Sinusoidal Extraction Using an Efficient Implementation of a Multi-resolution Fft
نویسنده
چکیده
This paper provides a detailed description of the spectral analysis front-end of a melody extraction algorithm. Our particular approach aims at extracting the sinusoidal components from the audio signal. It includes a novel technique for the efficient computation of STFT spectra in different time-frequency resolutions. Furthermore, we exploit the application of local sinusoidality criteria, in order to detect stable sinusoids in individual FFT frames. The evaluation results show that a multi resolution analysis improves the sinusoidal extraction in polyphonic audio.
منابع مشابه
An Efficient Multi-Resolution Spectral Transform for Music Analysis
In this paper we focus on multi-resolution spectral analysis algorithms for music signals based on the FFT. Two previously devised efficient algorithms (efficient constantQ transform [1] and multiresolution FFT [2]) are reviewed and compared with a new proposal based on the IIR filtering of the FFT. Apart from its simplicity, the proposed method shows to be a good compromise between design flex...
متن کاملSpeech analysis and coding using a multi-resolution sinusoidal transform
The sinusoidal transform, as developed by Quatieri and McAulay, provides a sparse representation for speech signals by taking advantage of psychoacoustic masking. The currently reported work takes the sinusoidal transform one step further by considering the frequency resolution abilities of the human auditory system in more detail. The new transform is based on the wavelet principle of variable...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملSurvey on Extraction of Sinusoids in Stationary Sounds
This paper makes a survey of the numerous analysis methods proposed in order to extract the frequency, amplitude, and phase of sinusoidal components from stationary sounds, which is of great interest for spectral modeling, digital audio effects, or pitch tracking for instance. We consider different methods that improve the frequency resolution of a plain FFT. We compare the accuracies in freque...
متن کاملFFTC: Fastest Fourier Transform for the IBM Cell Broadband Engine
The Sony-Toshiba-IBM Cell Broadband Engine is a heterogeneous multicore chip architectured for intensive gaming applications and high performance computing. It consists of a traditional microprocessor (called the PPE) that controls eight SIMD co-processing units called synergistic processor elements (SPEs). We exploit the architectural features of the Cell processor to design an efficient paral...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006